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Section 1 Overview and Pin Functions 



1.1 Processor Features 

This is a single-chip RISC microprocessor thai integrates a RISC-type architecture CPU core and a 
single precision FPU (Floating Point Unit), an onchip multiplier, a cache memory, and a memory 
management unit as well as peripheral functions required for system configuration. The processor 
includes data protection and virtual memory functions. 

The Processor also contains the timer, a real time clock, an interrupt controller, and a serial 
communication interface as peripheral functions necessary for the system configuration. An 
external memory access support function enables direct connection to DRAM and SDRAM. The 
Processor microprocessor also supports a PCMCIA interface. 

A powerful built-in power management function keeps power consumption low, even during high- 
speed operation. The Processor can run at four times the frequency of the system bus operating 
speed, making it optimum for systems requiring both high speed and low power consumption. 

The features of the Processor are listed in table 1 . 1 . 
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Table 1.1 Processor Features 
Item Features 



CPU 



• Original RISC architecture — — — 

• 32-bit internal data paths 

• General-register files 

— Sixteen 32-bit general registers (eight 32-bit shadow registers) 

• RISC-type instruction set (upward compatibility with the series) 

— Instruction length: 16-bit fixed length for improved code efficiency 

— Load-store architecture 

— Delayed branch instructions 

— Instruction set based on C language 

• Instruction execution time: one cycle for basic instructions 

• Logical address space: 4 Gbytes (448-Mbytes actual memory space) 

• Space identifier ASID: 8 bits, 256 logical address spaces 

• Onchip multiplier 

• Five-stage pipeline 



generator # C,ock mode: selec *ed from an onchip oscillator module, a frequency-doubling 

circuit, or a clock output by combining them by PLL synchronization 

• Processing states: 

— Power-on reset state 

— Manual reset state 

— Exception processing state 

— Program execution state 
~ Power-down state 

— Bus-released state 

• Power-down modes: 

— Sleep mode 

— Standby mode 

— Module stop mode 
• Onchip clock pulse generator 
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Table 1.1 Processor Features (cont.) 



Item Features — •—— - . 

• Separate Pipeline for Floating Point Operation*; ~ ~ ~ 

• Sinole precision floati ng-point format is supported , 

• Subset of IEEE standard's data tvn* j s supporter^ 

Z Invalid Operation and nivisioh bv yarn exceptions are supported , (s yfasfll of 

'EEE standard ) 

• ROUndinn tO Zero is SUDDOrted fsuhsets of |EFF standarrj ) . 
» General-moister files 

— Sixteen 32-bit floating agisters, 

• FMAC f multinlv * Accumulate! |s su pported 
I FDIV/FSORT are s.innnrteH 

• FLDfO/FT on Oflad cojsiaiuQaiatfi suppaned 

! Instruction latency time- two cycles for FMArvFAnry p sUB/FMt II 
! Execution nitch ; one cvde for FMA(VPAn o/csi ir/fmi i| 
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Table 1.1 Processor Features (cont) 
Item Features 



Memory 

management 

unit 



Cache memory 



Interrupt 
controller 



User break 
controller 



• 4 Gbytes of address space, 256 address spaces (ASID 8 bits) 

• Page unit sharing 

• Supports multiple page sizes: 1, 4 Kbytes 

• 128-entry, 4-way set associative TLB 

• Supports software selection of replacement method and random-replacement 
algorithms / ' • 

« Contents of TLB are directly accessible b y address mapping 

• 8-Kbytes, unified instruction/data — — — . _ 

• 128 entries, 4-way set associative, 16-byte block length 

• Write-back/write-through, LRU replacement algorithm 

• 1 -stage write-back buffer 

• Contents of cache memory can be accessed directly by address mapping (can 
be used as onchip memory) 

• 17 levels of external interrupt (NMI, IRQ), external interrupt pins (NMI 
IRL3-IRL0) ' 

• Onchip peripheral interrupts: set priority levels for each module 

• Supports debugging by user break interrupts 

• 2 break channels — 

• Addresses, data values, type of access, and data size can all be set as break . 
conditions 

• Supports a sequential break function 



A5 



Table 1.1 Processor Features (cont.) 



Item 


Features 


Bus state 


• Supports external memory access 


controller 


— 32/16-bit external data bus 




• Physical address space divided into seven areas, each a maximum 64 




Mbytes, with the following features settable for each area: 




— Bus size M 6 or 32 bits) 




— fNumoer 01 wan cycles ^cuou supports a naruwaro wan luncuunj 




Qattinrt tha tuno nf cnarfi PHAhloc Hirar't mnnfw*ttnn tA HR AM ^HR AM 
— OtJuiny uio typo vji spauo ci lauiao uuaui uuiiito^-uuii iu \j r\/"\ivi , sju> ri^ivi. 




and hurst ROM 




— Supports PCMCIA 




— Outputs chip select signal (CS0-CS6) for corresponding area 




• DRAM/^nRAM refresh function 
w lj n/A jvi/ %j L/ ti^ivi lonooii iuii^uvjii 




. — Programmable refresh interval 




— ^unnnrt 4 * CAS-before-RAS refresh and self-refresh rnodes 




— ouppons power-aown un/Mvi 




« HO AJUI/QDQ AIL4 Kiiret a^vocc ftinr>tinn 
• U itMlVl/ OLy riM/Vl UUlol aCCSSS iunCUOU 




• 32bit MultiDlex SRAM i address and data muftiDlexed} burst access function 




• Switchable between big and Irttle Endian 


Timer 


• 3-channel auto-reload type 32-bit timer 




• Input capture function 




• 6 types of counter input clocks can be selected 




• Maximum resolution: 2 MHz 


Real time clock 


• Built-in clock and calendar functions 




• Onchip 32-kHz crystal oscillator circuit with a maximum resolution (cycle 




interrupt) of 1/256 second 


Serial communi-" 


• Select start-stop sync mode or clock sync system 


cation interface 


• Full-duplex communication 




• Supports smart card interface 


Package 


• 1 44-pin plastic QFP (FP-1 44A) 



1.2 Block Diagram 

Figure 1.1 is a functional block diagram of the Processor. 
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Figure 1.1 Processor Functional Block Diagram 
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1.3 Pin Description 



U.l Pin Arrangement 
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Figure 12 Pin Arrangement (144-Pin Plastic QFP) 
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U.2 Processor Pin Functions 



Table 1.2 Processor Pin Functions - 



No. 


Terminal 


I/O 


Description 


1 


D27 


I/O 


Data/Address* bus 


2 


D26 


I/O 


Data/Address, bus 


3 


D25 


I/O 


Data/Address* bus 


4 


D24 


I/O 


Data/ Address* bus 


5 


D23 


I/O 


Data/Address* bus 


6 


. GND 


Power 


Power (0 V) 


7 


vcc 


Power 


Power (3.3 V) 


8 


D22 


I/O 


Data/ Address* bus 


9 


D21 


I/O 


Data/Address* bus 


10 


020 


I/O 


Data/Address* bus 


11 


D19 


I/O 


Data/Address* bus 


12 


D18 


I/O 


Data/Address* bus 


13 


D17 


I/O 


Data/Addrass" bus 


14 


016 


I/O 


Data/Address* bus 


15 


D15 


I/O 


Data/Address* bus 


16 


D14 


I/O 


Data/AddreSS* bus 


17 


GND 


Power 


Power (0 V) 


18 


VCC 


Power 


Power (3.3 V) 


19 


GND 


Power 


Power (0 V) 


20 


VCC 


Power 


Power (3.3 V) 


21 


D13 


I/O 


Data/ Address* bus 


22 


012 


I/O 


Data/Address* bus 


23 


011 


I/O 


Data/Address* bus 


24 


D10 


I/O 


Data/Address* bus 


25 


09 ' 


I/O 


Data/Address* bus 


26 


08 


I/O 


Data/Address* bus 


27 


07 


I/O 


Data/Addre§5*. bus 


28 


06 


I/O 


Data/Address* bus 


29 


D5 


I/O 


Data/Address* bus 


* notes; Address is available antra MPX-SRAM interface only. 
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Table 1.2 Processor Pin Functions (cont.) 



nu. 


Tarmi nal 
. 1 ci 1 1 1 1 1 1 a 1 


I/O 


Description 


in 




Power 


Power (0 V) 


11 




Power 


rower (3.3 V) 


1? 


Da ' 




uata/Address bus 


11 
JO 


ni 


i/r\ 
UU 


Data/AcJdre§§ bus 


34 


HP 

L/C 




Uata/Address bus 




L/ I 




Uata/Address bus 


Ifi 

JO 


nn 


l/U 


Data/Addre§$ bus 


17 


An 


r\ 
\J 


Address bus 


38 


A1 


O 




39 




n 


Address bus 


40 


A3 




Address bus 


41 


GND 


Power 


rower (0 v) 


42 


VCC 


Power 


Mower (3.3 V) 


43 


A4 


n 


Address bus 


44 


A5 


n 


Address bus 


45 


A6 


r\ 
vJ 


Address bus 


46 


A7 


w 


Address bus 


47 


A8 


r\ 
\J 


Address bus 


48 


A9 ^ 




Address bus 


49 


GND 


Power 


Power (0 V) 


50 


VCC 


Power 


Power (3.3 V) 


51 




O 


Address bus 


52 


A11 


O 


Address bus 


53 


A12 


O 


Address bus 


54 


GND 


Power 


Power (0 V) 


55 


VCC 


Power 


Power (3:3 V) 


56 


A13 


O 


Address bus 


57 


A14 


0 


Address bus 


58 


A15 


O 


Address bus 


59 


GND 


Power 


Power (0 V) 


60 . 


VCC 


Power 


Power (3.3 V) 



• POtes; Address is available at the MPX-S RAM rnterfa™ only 
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Table 1.2 Processor Pin Functions (cont.) 




No. 


Terminal 


I/O 


Description 


61 


A16 


o 


Address bus 


62 


A17 


o 


Address bus 


. 63 . 


. A18 


o 


Address bus 


64 


A19 


o 


Address bus 


65 


A20 


o 


Address bus 


66 


A21 


o 


Address bus 


67 


A22 


o 


Address bus 


68 


GND 


Power 


Power (0 V) 


69 


VCC 


Power 


Power (3.3 V) 


70 


A23 


0 


Address bus 


71 


A24 


O 


Address bus 


72 


A25 


O 


Address bus 


73 


GND(PLU) 


Power 


Power (0 V) for onchip PLL 


74 


CAP1 


0 . 


External capacitance pin for PLL 


75 


VCC(PLU) 


Power 


Power (3.3 V) for onchip PLL 


76 


GND(PLL2) 


Power 


Power (0 V) for onchip PLL 


77 


CAP2 


0 


External capacitance pin for PLL 


78 


VCC(PLL2) 


Power 


Power (3.3 V) for onchip PLL 


79 


EXTAL 


I 


External clock/crystal oscillator pin 


80 


XTAL. 


O 


Crystal oscillator pin 


81 




I 


Pull this pin up 


82 


GND 


Power 


Power (0V) 


83 


VCC 


Power 


Power (3.3 V) 


84 


MD2/RXD 


I 


Operating mode pin/serial data input 


85 


MD1/TXD 


I/O 


Operating mode pin/serial data output 


86 


MDO/SCK 


I/O 


Operating mode pin/serial clock 


87 


BREQ 


I 


Bus request 


88 


RESET 


I 


Reset 


89 


MM I 


I 


Nonmaskable interrupt request 


90 


IRL3 


I 


External interrupt source input 


91 


IRL2 


I 


External interrupt source input 



All 



Table 1.2 Processor Pin Functions (cont.) 



No. 


Terminal 


I/O 


Description 




92 


IRL1 


I 


External interrupt source input 




93 


IRLO 


I 


External interrupt source input 




94 


IOIS16 


I 


101 6-bit instruction 





95 


IRQOUT 


o 


Interrupt request notification 




96 


BACK 


o 


Bus acknowledge 




97 


STATUS 1 


o 


Processor status 




98 ^ 


STATUS0 


0 


Processor status 




99 


CPACK 


o 


Clock pause acknowledge 




100 


GND 


Power 


Power (0 V) 




101 . 


CKIO 


I/O 


System clock I/O 




102 


VCC 


Power 


Power (3.3 V) 




103 


MD4/CE2B 


I/O 


Operating mode pin/PCMCIA CE pin 




104 


MD3/CE2A 


I/O 


Operating mode pin/PCMCIA CE pin 





105 


BS 


O 


Bus cycle start 




106 


RD/WR 


O 


Read/write 




107 


RO/FRAME 


O 


Read pulse/prame. 




108 


CS6/CE1B 


O 


Chip select 6/PCMCIA CE pin 




109 


CS5/CE1A 


O 


Chip select 5/PCMCIA CE pin 


— : 


110 


CS4 


O 


Chip select 4 




111 


CS3 


O 


Chip select 3 




112 


CS2 


O 


Chip select 2 




113 


csr 


O 


Chip select 1 




114 


cso / 


O 


Chip select 0 




115 


GND 


Power 


Power (0 V) 




116 


VCC 


Power 


Power (3.3 V) 




117 


WE3/DQMUU/ICIOWR 


O 


D3 1-024 selection signal/IO write 




118 


WE2/DQMUL/ICIORD 


O 


D23-01 6 selection signal/IO read 




119 


CASHH/CAS2H 


O 


D31-024/D15-08 selection signal 




120 


CASHL/CAS2L 


O 


D23-D1 6/D7-D0 selection signal 




121 


GND 


Power 


Power (0 V) 




122 


VCC 


Power 


Power (3.3 V) 
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Table 1.2 Processor Pin Functions (cont.) 



No. Terminal I/O Description 



123 


WE1/0QMLU 


0 


D15-D8 selection signal 


124 


WEO/DQMLL 


0 


D7-D0 selection signal 


125 


CASLH 


0 


D15-D8 selection signal 


126 


CASLL7CAS 


0 


D7-D0 selection/memory selection signal 


127 


GND 


Power 


Power (0 V) 


128 


vec 


Power 


Power (3.3 V) 


129 


RAS 


0 


RAS for DRAM 


130 


MD5/RAS2 


I/O 


Operating mode pin/RAS for DRAM 


131 


CKE 


0 


Clock enable control for SDRAM 


132 


WAIT 


I 


Hardware wait request 


133 


GND 


Power 


Power (0 V) 


134 


JCLK 


I/O 


Clock I/O for TMU/RTC 


135 


VCC (RTC) 


Power 


Power for RTC (3.3 V) 


136 


XTAL2 


0 


Crystal oscillator pin for onchip RTC 


137 


EXTAL2 


1 


Crystal oscillator pin for onchip RTC 


138 


GND (RTC) 


Power 


Power for RTC (0 V) 


139 


VCC 


Power 


Power (3.3 V) 


140 


D31 


I/O 


Data/Address* hn* 


141 


D30 


I/O 


DataZAddiasalbus 


142 


D29 


I/O 


DataMddr§salbus 


143 


D28 


I/O 


Data^Addiasalbus 


144 


GND 


Power 


Power (0 V) 


* notes: \dd T *n jg available at the MPX-SP AVf intend only 
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Section 2 Programming Model of CPU and FPU 



2.1 Programming Model 

The Processor operates in user mode under normal conditions and enters privileged mode in 
response to an exception. Mode is specified by the mode bit (MD) in the status register. The 
registers accessible to the programmer differ, depending on the processor mode. 

General-purpose registers RO to R7 are banked registervwhich are switched by a processor mode 
change. In privileged mode (MD=l ), the register bank (RB) bit defines which banked register set 
is accessed as general-purpose registers, and which set is accessed only through the load control 
register (LDC) and store control register (STC) instructions. 

When the RB bit is a logic one, B ANK1 general-purpose registers R0-R7_BANK1 and non- 
banked general-purpose registers R8-R 15 function as the general-purpose register set, with 
B ANKO general-purpose registers R0-R7_BANK0 accessed only by the LDC/STC instructions. 

When the RB bit is a logic zero, B ANKO general-puipose registers RO-R7_B ANKO and non- 
banked general-purpose registers R8-R15 function as the general -purpose register set, with 
B ANK1 general -purpose registers RO-R7_BANKl accessed only by the LDC/STC instructions. 
In user mode (MD=0) B ANKO general purpose registers R0-R7_BANK0 function as the general 
purpose register set regardless of register bank (RB) setting. The programming model for each 
processor mode is listed in figure 2. 1 and the registers are briefly defined in figures 2.2 and 2.3. 
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Notes: 1 



RO functions as an index register in the indexed register-indirect addressing mode 
and indexed GBR-indirect addressing mode. In some instructions, only RO can be 
used as the source register or destination register. 
2. R0-R7 are banked registers. In user mode, BANKO is used. In privileged mode 
- SR.RB specifies BANK. SR.RB = 0: BANKO is used. SR.RB = 1: BANK1 is used. 
1 These registers are only accessed by LDC/STC instructions. SR.RB specifies 
BANK. SR.RB = 0: BANKO is used. SR.RB = 1: 8ANK1 is used. 



Figure 2,1 Programming Model of CPU and FPU 
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General-Purpose Registers 

Notes; 

I . RO functions as an index register in the indexed 
register- indirect addressing mode and indexed 
GBR-indiresi addressing mode. In some msuuetions. 
only RO can be used as the source register or 
destmarioo register. 

2 R0-R7 are banked registers. Each of the two 
processor operation modes has us own R0-R7 register. 
In privileged mm ode, the RB bit in the SR defines which 
set | RO-R7JJANK0 cx R0-R7.BANK I) is accessed as 
general-purpose registers. 



Floating point Registers 

Notes: 

1. FO functions as the multiplied values for 
FMACu 



System Registers 

Multiply and Accumulate High and Low Registers(MACH/L) 
S tore (he results of m duplication and accumulation 
opearatjom. 

FPU Low FUgistsr (FPUL) 

Use the buffer as (he cammaucaooo register between CPU and FPU. 
Procedure Register (PR) 

Stores the retuea address for exiting subroutine procedures. 



31 



TT 



FPSCR 



Program Counter (PQ 

Indicates the starting address of (he current instruct wo 
incremented by four. . . 

Floating Point Starus/Control Register (FPU SCR) 
Indicates the Status or control information for Boating point. 
( jnore detail, see Section 16) 



Figure 2.2 Register Set Overview, GPRs, FPRs and System Registers 
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Saved Status Regster (SSR) 

Stores current SR value at 1meof ecepfan to irdcate 
processor status in tie return to instruclon stream tocm 
except cn harder. 

Saved Program Counter (SPC) 

Stores current PC value attmeof ewceplcn to irdcate 

return address at ccmpielcn of acoepticn processing. 

Global Base Register (GBR) 

Stores ihe base address of tie G BR-ind rect addressing 

mode. The Q BR-tnd reot addressing mode is used *> 

transfer data to the regster areas of tie resident 
peripheral moddes, and farlogc operations. 

factor Base Register (VBR) 

Stores the base address of tie except an processing 

vector area. 

10 9 8 7 3 10^ 

Status 

Regster (SR) 



EEEEE3E0EQ 



T bit: The MOVT, CMP/cond FCMP/cond. TAS. TST, BT, BF. SETT, CLRT, and DT 
instructions use the T bit to indicate true (logic one) or false (logic zero). The 
ADDV/C, SUBV/C, DIVOU/S, DIV1, NEGC, SHAR/L, SHLR/L, ROTR/L, and RO 
TCR/L instructions also use the T bit to indicate a carry, borrow, overflow, or 
underflow. 

S bit: Used by the MAC instruction. 
Zero bits: Always read as 0, and should always be written as 0. 
I MASK: 4-bit field indicating the interrupt request mask level. 
M and Q bits: Used by the DIVOU/S and DIV1 instructions. 

RB: Register bank bit: defines the general-purpose registers in privileged mode. A 
logic one designates RO-R7J3ANK1 and R8-R1 5 are accessed as general- 
purpose registers, and RO-R7J3ANKO are only accessed by LDC/STC 
instructions; a logic zero designates R0-R7_BANK0 and R8-R1 5 are accessed 
as general-purpose registers, and R0-R7_BANK1 are only accessed by 
LDC/STC instructions. 

BL: Block bit: masks exceptions in privileged mode as follows: 

BL = 1, exceptions are masked (not accepted); BL = 0, exceptions are accepted 
MD: Processor operation mode bit: indicates the processor operation mode as 
follows: 1 = Privileged mode; 0 = User mode 
Note: Only the M, Q, S, and T bits are read or written from user mode. All other bits are read 
<5r written from privileged mode. 



Figure 23 Register Set Overview, Control Registers 
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Section 4 Exception Processing 

4.1 Overview 

Exceptions are deviations from normal program execution that require special'handUne The 
pressor responds to an exception by aborting execution of the current instruction (execution is 
* owed to continue to completion in all interrupt requests) and passing contro. from the 
instruction stream to the appropriate user-written exception handling routine. 

^SiST" ° f T Md SR " S3Ved m -""V*** (SPC) and saved status 

ZZ T' reSpe f CUVely ' Md executl0n ° f * e ^Ption handler is invoked from a vector 
location. The return from exception handler (RTE) instruction is issued by the excepdon handler 
rouune at the completion of the rouune, restoring the contents of the PC and SR o Tec ov« *e 
.nstrucuon stream arid the processor status from the point of interruption. 

^oTtoZ^ST f0Ur ,0Cati ° DS - HXed PhySiCal addrCSS H ' A000 ^ « dedicated as 
° ? P , U: 0thCr CVentS "* ""^ offsets ^ ■ vector table pointed to by a 
software-designated vector table base. The types of exception events assigned ofltoTtoS vecfoJ 
table consist of translation lookaside buffer (TLB) miss (H'OOOOmom ~i 7 
(UmncYVfYw ,„h > ^ wwwuo), general interrupt requests 

alexcepuon events inch.flinp FPU ration trip other than TLB miss 

nSki "h!SS 1116 VCCt0r ab ' e baSC iS ,0aded int0 ■* vector base register 

(VZe 4 ,) VCCt0r bMftSh0UUi feSide " P1 ° r K ^ ^ space 

A basic exception processing sequence consists of the following operations: 

• The contents of PC and SR are saved in SPC and SSR, respectively 

• ^e block (BL) bit in SR is set to a logic one, masking any subsequent exceptions 

• The mode (MD) bit in SR is set to a logic one to place the Processor in privileged mode 

• The register bank (RB) bit in SR is set to a logic one 

' e^enT(E^V^„?^ n8 "T£ ™"* bits 1 ^ - the exception 

event (fcXPEVT) or interrupt event (INTEVT) register. 

• Instruction execution jumps to the designated vector location to invoke the handler routine. 
If a general exception event is delected when the* m hit in cd « i ■ 

m«i~n , 8 8 Deral ,menupt rw l uest is detected *hen BL = 1. the request is 

masked (hejd pending) and not accepted until the BL bit is cleared to a logic zero by softie 

eXCePti ° D Pr0CeSSin8, SPC ^ SSR mUSt * ^ * BL "it in SR cleared to a 
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Processor resets and interrupts are asynchronous events unrelated to the instruction stream. All 
exception events are prioritized to establish an acceptance order whenever two or more exception 
events occur simultaneously (the power-on reset and manual restart reset are mutually exclusive 
events). All general exception events occur in a relative order in the execution sequence of an 
instruction (i.e., execution order), but are handled at priority level 2 in instruction-stream order 
(i.e., program order), where an exception detected in a preceding instruction is accepted prior to an 
exception detected in a subsequent instruction (figure 4.1). 

Three general exception events (reserved instruction exception, unconditional trap, and illegal slot 
instruction exception) are detected in the decode stage of different instructions and are mutually 
exclusive events in the instruction pipeline. In table 5.1, the exception events that trap to a vector 
location are listed by exception type, instruction completion status, relative priority of acceptance, 
relative order of occurrence within an instruction execution sequence, and vector location, the 
exception codes written into bits 1 1-0 of the EXPEVT register (for reset or general exception 
events) or the INTEVT register (for general interrupt requests) to identify each specific exception 
event are defined in table 4.2. 

An additional exception register, the TRA register, is used to hold the 8-bit immediate data in an 
unconditional trap (TRAPA instruction). The bit configurations of the EXPEVT, INTEVT, and 
TRA registers are diagrammed in figure 4.2. 
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Pipeline Sequence: 



Instruction n 

Instruction n + 1 

Instruction n + 2 
Detection Order 



A TLB miss (data access) 



A TLB miss (instruction access; 



A RIEXreserved instruction exception) 



TLB miss (instruction n+1) 

TLB miss (instruction n) and RIE (instruction n + 2) = simultaneous detection 



Handling Order: 



Program Order 



\ 



TLB miss (instruction n) 
Reexecution of instruction n 
TLB miss (instruction n + 1) 

I 

Reexecution of instruction n + 1 / 

\ 

RIE (instruction n + 2) 



I = Instruction fetch 
D = Instruction decode 
E = Instruction execution 
A = Memory access 
S = Result store 



.figure 4. 1 Example of Acceptance Order of General Exception Events 
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Table 4. 1 Vectored Exception Events 



Exception 
Type 


Current 

Instruction Exception Event 


Priority 


Exception 
Order 


Vector 


Vector 

OffftAt 

v-/n»ex 


Reset 


Aborted Power-on 


1 




H'AOOOOOOOO 


I — 




Manual reset 


1 




H'AOOOOOOOO 


— 


General 
events 


Aborted Address error 
and retried ' (instruction access) 


2 . 


1 




H'00000100 




TLB miss (instruction 2 
access) 






H'00000400 




TLB invalidation 
(instruction access) 


2 


3 





H'00000100 




TLB protection 
violation 


2 


4 




H'000001 00 




(instruction access) 












neservea 

instruction exception 


2 


5 


— • 


H'000001 00 




Illegal slot 

instruction exception 


2 


5 


— 


H'00000100 




. Address error 
(data access) 


2 


6 




n OOO001 00 




TLB miss 
(data access) 


2 


7 




n UUUUU4UU 




TLB invalidation 
(data access) 


2 


8 




H'nnnnm nn 




TLB protection 
violation 


2 


9 




H'00000100 




, (data access) 












FPU exception 


2 


10 




H'00000100 




Initial page write 


2 


11 




H'00000100 : 




Completed Unconditional trap 
. . (TRA instruction) 


2 


5 




H'00000100 




User breakpoint trap 


2 


n'2 




H'00000100 


General 
interrupt 


Completed Nonmaskable 
interrupt 


3 






H'00000600 


requests 

■ 


External hardware 
interrupt 


4-3 






H'00000600 




Peripheral module 
interrupt 


4-3 






H'oboooeoo 
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Table 4.2 EXPEVT Register and INTEVT Register Exception Codes 



Exception Type 


Exception Event 




Reset 


Power-on 






Manual reset 


n \jc.\j 


General exception events 


TLB miss (load) 






TLB miss (store) 


H'060 




initial page write 


H'080 




TLB protection violation '(load) 


H'OAO 




TLB protection violation (store) 


H'OCO 




Address error (load) 


H'OEO 




Address error (store) 


H'100 




FPU exception 


H'120 




Unconditional trap (TRA instruction) 


rf160 




Reserved instruction exception 


H'180 




Illegal slot instruction exception 


H'lAO 




User breakpoint trap 


KIEO 


Note: ExceotioncodeH'140is reserved 
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Table 4.2 EXPEVT Register and ENTEVT Register Exception Codes (cont) 

Exception Type Exception Event Exception Code 

General interrupt requests Nonmaskable interrupt H'1C0 

External hardware interrupt: 



IRL0-IRL3 = 0000 H'200 



IRLo-IRL 3 = 0001 


H'220 


IRLo-IRLa = 0010 


H'240 


IRLo-IRL 3 = 0011 / 


H'260 


.IRLo-IRL 3 = 0100 


H'280 


IRLo-IRL 3 = 0101 


H'2A0 


IR1ht*'RL 3 = 0110 


H'2C0 


1RLo-IRL 3 = 0111 


K2E0 


IRLo-IRL 3 = 1000 


H'300 


IRL <r IRL 3 = 1001 


H'320 


IRU-IRL 3 = 1010 


HT340 


IRLo-IRL 3 = 1011 


H l 360 


IRLq-IRL 3 = 0100 


K380 


lRLo-IRL 3 = 1101 


HT3A0 


IRLo-IR^^ 1110 


H'3C0 


IRLcrlRL 3 = 1111 


H'3E0 



Peripheral module interrupts H'4xx / H'Sxx 



EXPEVT register and iNTEVT register 
31 11 0 


TRA register 
31 




9 2 


0 




0 -0 


Exception code 






n 


imm 


00 














0: Reserved bits, always read as zero., 
imm: 8-bit immediate data in TRAPA instruction. 











" Figure 4.2 Bit Configurations of EXPEVT, INTEVT, and TRA Registers 



All interrupts are detected in the decode stage of the instruction pipeline and serviced on 
instruction boundaries (i.e., the instruction in which the interrupt occurs is allowed to continue to 
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completion, and upon returning from the interrupt handler, the instruction stream is resumed from 
the instruction following the instruction in which the interrupt occuned). However, interrupt 
requests are not accepted in any instruction executed between a delayed branch instruction and the 
instruction executed in the delay slot. 

When a general exception event is detected in a delay slot, the return address saved in SPC is the 
address of the related delayed branch instruction rather than that of the instruction in which the 
exception was detected. The illegal slot exception results whenever an attempt is made to execute 
a TRAPA instruction or certain branch operations in a delay slot. Such branch operations include 
the JMP, JSR. BRA, BRAF, BSR, BSRF, RTS, RTE, BT..BT/S, BF, and BF/S instructions 
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16. Floating Point Architecture 

16.1 Introduction 

This chapter. describes the Instruction Set Architecture of the FPU Floating Point Unit. This 
architecture has been defined to be compatible with future generation of this architecture. FPU 
provides a limited set of floating point instructions with the objective of supporting graphics 
processing in video games. FPU supports single precision floating point operations and FPU 
will emulate double precision floating point operations to support the full PC API system 
software. The FPU specification described here is a proper subset of a complete architecture 
for IEEE 754 floating point architecture. 

16.2 Floating Point Format 
16.1.1 Floating Point Format 



31 30 



23 22 



f 



A floating point number contains three fields: a sign,. 5, an exponent, e, and a fraction,/ The exponent is biased, that is, it 
is of the form e = E + bias. The range of the unbiased exponent E runs from E fT ^ n A to E max +1. Two values are 
distinguished, £ m rt - I. which flags zero (both positive and negative) and denonnalized numbers, and E max + I, which 
flags positive and negative infinity and NaNs (Not a Number). For single precision, the bias is 127, E min is -126 and 



is 127. 



The value of a floating point number, v, may be determined as follows: 



if E — Emax +1 and f != 0, then v is a NaN, regardless of s 



if E = Emax +1 and f = 0, then v = (-I)* (infinity) 



if Emin <=E <= Emax then v = (- 1 )* 2* ( 1 ./ ) 



if E = Emin - 1 and f != 0, then v = (- iy 2»™ (0./> 



if £ = Emin -1 and f = 0, then v = (-1)* 0 



16.1 J Not a Number (NaN) 

To represent a NaN for a single precision value, at least one of bit 22-0 must be set. Bit 22, if set, indicates a signaling 
NaN. If reset, the value is a quiet NaN. The bit pattern for NaN is shown in the Figure. The bit N is set for signaling NaNs 
and reset for quiet NaNs; x indicates a don't-care bit, but at least one of bit 22-0 is set. For NaNs, the sign bit is a don't 
care. 



31 30 



23 22 



11111111 



Nxxxxxxxxxxxxxxxxxxxxxx 



If the input to the operation generating a floating-point-value is an sNaN: 

a. tfte output is a qNaN if invalid operation exception bit in FPSCR is not enabled; 

b. raises invalid operation exception if invalid operation exception bit is enabled in FPSGR. In this case the destination 
of the operation is not changed. 

If the input of the operation generating a floating-point-value is a qNaN and no input of the operation is an sNaN, the 
output is always a qNaN irrespective of the setting of the invalid operation exception enable bit in FPSCR and will not 
cause any exception. 
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For another floating-point operation whose input is a NaN. please refer the individual operauon description. 

16.1.3 Denormalized Value 

i ^normalized rloatine pom. number ., represented with biased exponent to be 0. fraction pan to be non-zero and hidden 
iTo oto^ "numbers .source operand, or a result) are uniformly Rushed to zero by a va.ue-generating floating- 
point operation .the operation other than just a copy) in SH3E floating point unit. 

16.1.4 Other Special Values 

There are several distinguished values in the space of floating point values as shown in Table 1 . 



Value 


Representation 


+0.0 


oxoboooooo 


-0.0 


0x80000000 


Denormalized Value 


(as shown above) 


+INF. 


Ox7F800000 


-INF 


OxFF800000 


qNaN. quiet NaN 


(as shown above) 


sNaN. signaling NaN 


(as shown above) 



16J Floating Point Registers and System Registers for FPU 

16.3.1 Floating Point Register File 

The SH3E provides 16 32-bit single precision floating point registers. Register designators are always 4-biis. In assembly 
language the floaung point registers are designated as FRO, TV.. 7 32. and so forth. 

16.3.2 Floating Point Communication Register (FPUL) 

Information is transferred between the FPU and the CPU through a communication register. FPUL ™^ us 10 * e 
MACL and MACH registers of the integer unit. FPUL is considered as a system register, accessed on the CPU sioc oy 
LDS and STS instructions. FPUL is assigned address 0x03 in the systems register space. 

16.3.3 Floating Point Status / Control Register (FPSCR) 



31 


18 


12 


7 


2 0 


reserved 


DN 


cause 
EVZOUI 


enable 
VZOUI 


flag Irm 
VZOUI | 



The SH3E implements a floaiine point status and control register. FPSCR. as a system register accessed through the LDS 
and STS instructions. FPSCR is available for modification by the user program. The FPSCR is pan of the per process 
context and must be saved across context switches and may need to be saved across procedure calls. 

The FPSCR is a 32-bit register, which controls the FPU by controlling rounding, gradual underflow (denormalized 
values .. and caotures details" about floaiine oomi exceotions. No FPU enable bit is orovided. the FPU is alwavs enabled. 
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There are five possible FPU exceptions: Inexact (I), Underflow (F), Overflow (O), Division by Zero (Z). and Invalid 
Operauon (V). A sixth exception flag, FPU error (E) is also provided to allow the FPU to report other error conditions. In 
the FPU. only the V and Z exceptions are supported. 



Table 2: Floating Point Exception 1 


r lags 


Flap 




Support in this 
FPU 


E 


FPU error. 


No 


V 


Invalid Operation! 


Yes 


z 


Divide By Zero. 


Yes 


0 


Overflow. Value cannot be represented. 


No 


U i 


Underflow. Value cannot be represented. 


No 


I 


Inexact. The result cannot be represented. 


No 



The bits in Lhe cause field indicate the cause of exception during the execution of the current instruction. The cause bits 
are modified by execution of a floating point instruction (capable of causing exception). These bits are set to 0 or 1 
depending on occurrence or non-occurrence of exception conditions during the execution of the current instruction. It is 
possible for the FPU to set more than one bit in this field during the execution of a single instruction, if multiple exception 
conditions are detected. The bits in the enable field indicate the specific type of exceptions that are enabled to raise an 
exception (i.e., change of flow to an exception handling procedure). An exception is raised if the enable bit and the 
corresponding cause bit are set by the execution of the current instruction. The bits in the flag field. are used to capture the 
cumulative effect of all exceptions during the execution of a sequence of instructions. These bits, once set by an 
instruction can not be reset by following instructions. The bits in this field can only be reset by an explicit store operation 
on FPSCR. 

In FPU, each bit of cause EOUI, enable OUT, flag OUI and reserved field is predetermined to zero and the following fields 
are predetermined to the following values : 

RM = 01, indicating that rounding is always towards zero; (RZ mode) 

DN = 1 , indicating that denormalized source or destination operand will be flushed to zero. 

The predetermined values cannot be modified even by LDS instruction. 
16.4 Floating Point Exception Model 

16.4.1 Enabled Exception 

Both V and Z exceptions can be enabled by setting its enable bit. AU exceptions raised by the FPU are mapped onto the 
same CPU Exception Event. The semantics of the exception are determined by software by reading the system register 
FPSCR and interpreting the information maintained. there. 

16.4.2 Disabled Exception 

If enable bit V is not set, invalid operations produce qNaN as a result, (except FCM P/FTEST/NAN and FTRC) If enable 
bit Z is not set, a division by zero produces a correctly signed infinity. 

Overflow , underflow , and inexact exceptions do not appear as bit settings in either the cause or flag fields of the FPSCR. 
An overflow will produce the number whose absolute value is the largest represeniable finite number in the format with a 
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correct sign bit. An underflow will produce a correctly signed zero. If the result of an operation is inexact, the destination 
register will have the inexact result. 



16.4.3 Exception Event and Code for FPU 

All FPU exceptions are mapped onto the single general exception event FPU EXCEPTION, which is assigned exception 
code Ox 1 20. Loads and Stores raise the normal memory management general exceptions. 

16.4.4 Alignment of Floating Point Data in Memory 

Single precision floating point data is aligned on modulus 4 boundaries, that is, it is aligned in the same fashion as CPU 
long integers. 

16.4.5 Arithmetic with Special Operands conforms to CEEE 754 

All arithmetic with special operands (qNaN, sNaN/ +INF, -INF, +0. -0) follows IEEE 754 rules. These values are 
specified for each of the individual operations in this document. 

16.5 Synchronization with CPU 

16.5.1 Synchronization with CPU 

Floating-point operations and CPU operations are issued serially in a program order, but may complete out-of-order 
because the execution cycles are different. The floating point operation accessing only FPU resources does not require 
synchronization with CPU, and the following CPU operations can complete before the completion of the floating point 
operation. Therefore a proper program can hide the execution cycle of a long execution cycle floating point operation such 
as Divide. On the other hand, the floating point operation accessing CPU resources such as Compare requires the 
synchronization to ensure the program order. 

16.5.2 Floating Point Operations Requiring Synchronization 

Loads, Stores, Compares^ and operations accessing FPUL access CPU resources and require the synchronization 
Loads and Stores refer a general register. Post-increment loads and predecrement stores modify a general register 
Comparestesi modify T bit. The operations accessing FPUL refer or modify FPUL. These references and modifications 
are synchronized with CPU. , 

16.5.3 Maintaining Program Order on Exceptions 

Floating point operations never complete before the following CPU operations complete. FPU EXCEPTION is detected 
before the following CPU operations complete, and if FPU EXCEPTION is raised, the following operations are canceled. 
Therefore the program order is ensured even when FPU EXCEPTION is raised. 

During a floating point operation execution if a following operation raises an exception, the floating point operation is left 
execuung until FPU resources are accessed, and the access waits the completion of the floating point operation. Therefore 
the program order is ensured. 

16.6 Floating Point Instructions 

All floating point instructions are shown in Table 3, and new CPU Instructions Related to FPU are shown in Table 4. 
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Table 3: Floating Point Instructions 





Operation 


op code 


mnemonic 


Floating Move (Load) - 


FNM8 


FMQV.S @Rm,ERn 


Floating Move (Store) 


FNMA 


FMOV (r S FRrn. ©Rn 


Floating Move (Restore) 


FNM9 


■FMOV.S @Rm+,ERn 


Boating Move ^Save) 


FNMB 


FMOV.S FRm <3>-Rn 


Roatine Move (Load with tnd^) 


FNMfi 


FMOV.S ©fRO. Rm) H?n 


Floating Move fSmre wirh inda) - 


FNM7 


FMOV.S FRrn. (5>fR0 Rn^ 


Roating Move fin register file) 


FNMC 


FMOVFRm,ERn 


Roaring Load Immediate 0 


FN8D 


FLDIOFRn 


Roatine Load Immediate 1 


FN9D 


FLDIlERn 


Roating Add 


FNMO 


FADD ERm, £Rn 


Roating Subtract 


FNM1 


FSUB ERm, ERn 


Roating Multiply 


FNM2 


FMULERm, ERn 


Roating Divide 


FNM3 


FDIVERnuEEn 


Roating Multiply Accumulate 


FNME 


FMAC FRO, ERm, ERn 


Roating Compare Equal 


FNM4 


FCMP/EQ ERm, ERn 


Roating Compare Greater Than 


FNM5 


FCMP/GT ERm, ERn 


Floating Te$[ NaJSf 


FN7D 


FTST/NAN _ERr 


Roating Negate 
Roating Absolute Value 


FN4D 
FN5D 


FNEG ERn 
FABS ERn 


Roating Square Root 


FN6D 


FSQRT ERP 


Roating Convert from Integer 


FN2D 


FLOAT FPUL, ERn 


Roating Truncate and Convert to Integer i 


FN3D 


FTRC ERm, FPUL 




Roating Store from System Register FPUL 


FNOD 


FSTS FPUL, ERn 




Roating Load to System Register FPUL 


FN1D 


FLDS ERm, FPUL 




Table 4: New CPU Instructions Related to FPU 




Operation 


op code 


mnemonic 




Load from System Register FPUL 


4N5A 


LDS Rm, FPUL 




Restore System Register FPUL 


4N56 


LDSj^ @Rm+, FPUL 




Load from System Register FPSCR ' 


4N6A 


LDS Rm. FPSCR 




Restore S ystera Register FPSCR 


4N66 


LDSJU @Rm+, FPSCR 




Store to System Register FPUL 


0N5A 


STS FPUL. Rn 




Save System Register FPUL 


4N52. . 


STSiFPUU@.Rn 




Store to System Register FPSCR ! 


0N6A 


STS FPSCR. Rn 




Save System Register FPSCR 


4N62 


i FPSCR, @-Rn 
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16.6.1 Binary Operations 

Boating point addition, subtraction, multiplication and di 
style. 



vision are binary operations following the CPU architectural 



16.6.2 Comparisons 

tests Tor individuai colons or'Sffi^ff^^ - * ^ C ™ " d »™» 
and car be used there to control branching or to comoute en ZiZ f 1 ^ "* l ° SCt ** T " bit of ** CPU 
condition r J« ^ rTSTVNAN FPn PJass M ^ ry^ w Qe^nn^nc h» w M ^ 



16.6.3 Loads and Stores 

efficiently. And lnad< uJ gnd ex andX« " i s ? . a " 0W fl ° ating point values 10 «* P^ed and popped 



loads and stores are executed in the same way as long loads/stor e Z hS ™* mm r ™ b - Sin g [e P™cUion 

sourced (for a store) and written (for a load). ,0ads/store - T*** only m that the floating point register file is 



negate, are provided as unary operations to conserve 



instruction coding 



16.6.4 Other Operations 

Most other operations, for example floating point 
space. 

16.6.5 CPU Instructions for FPU 

16.6.6 Detailed Descriptions 

per iions are described in C language. The common C functions are shown below. 



#define CAUSE_V. 
#def ine CAUSE_Z 
^define ENABLE_V 
^define ENABLE_Z 
#def ine" FLAG_V 
#define FLAG Z 



. *def ine 
#define 
#def ine 
#define 
^define 
#def ine 
#def ine 



NORM 

PZERO 

NZERO 

PINF 

NINF 

sNaN 

QNaN 



0x00010000 
0x00008000 
0x00000800 
0x00000400 
0x00000040 
0x00000020 



O'xO /' 



/■* FPSCR (bit 16) 
/* FPSCR (bit 15) 
/* FPSCR (bit 11) 
/* FPSCR (bit 10) 
/* FPSCR (bit 6) 
/* FPSCR (bit 5) 



There is no special 
0x2 /*. meaning for this code 
0x3 /* assignment to the data 
0x4 /♦ types. 
0x5 /* 

0x6 /* This is just for . 
0x7 /* the description. 



*/. 

V 

V 

V 

*/ 

V 

V 
V 
V 
*/ 
V 
V 
V 
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"define EQ 0x0 /* There is no special ' V 

^define UO 0x1 /* meaning for this code V 

♦define GT 0x2 /* assignment. */ 

#define NOTGT 0x3 /* This is just for . V 

#define INVALID 0x4../* the description. V 

. • ' .1 

long . FPSCR; 
' int T; 

int load_Icng ( long *adress,* data) 

/* This function is defined in CPU part * / 
int store_long(long *adress , *data) 

■ /* This function is defined in CPU part */ 

int sign_of(long *src) 

{ 

return(*src >> 31); 
; ■ 
int data_type_of ( long *src) 

{. 

. float abs; 

abs = *src & 0x7fffffff ; 
'if (abs < 0x00800000) { 

if (sign_of (src) == 0) return ( P2ERO) ; 
^ else return (NZERO) ; 

else if ((0x00800000 <= abs) && {abs < 0x7f 800000 ) ) 

return (NORM) ;'. 

else if (0x7f800000 == abs) { 

if (sign^of (src) == 0) return (PINF) ; 
j else return(NINF) ; 

.else if (0x00400000 tabs) return ( sNaN) ; 
else " ' return (qNaN) ; 

}■ . 

clear_cause_VZ() { FPSCR (-CAUSE.V & ~CAUSE_Z> : ; } 

set_V() { FPSCR |= <CAUSE_V I FLAG V) • } 
set.ZO { FPSCR |= (CAUSE_Z | FLAG_Z )■ ; } , 

invalid(f loat *dest) 

{ . " . . 

set.VO; '" 

if ((FPSCR Sc ENABLED) == 0) qnan(dest); 

} , 

dz.( float *dest, int sign) ■ 1 

set Z ( ) ; 

^ if ((FPSCR & ENABLE J) == 0) inf (dest , sign) ; 
zero (.float *dest, int sign) 

if (sign == 0) *dest = 0x00000000; 

" else Mest = OxSOOOOOOoi - 

inf (float *dest, int sign) 
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if (sign == 0) »dest = 0x7f 800000; 

else *dest = 0xff800000; 

} 

qnan ( f loac *dest) 
{ 

*dest = 0x7£bf ffff ; 

} ' ' ■ ■' '.. 
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FABS (Floating Point Absolute Value): Floating Point Instruction 
Format Abstract Code 



Latency Pitch T bit 



FABS .FRn 



|FEn -> FRn 



llllnnnnOlOlllOl. 



Description: Takes floating-point-arithmetic absolute value of the content of floating point register £En. And the 
result of this operation is written on the £fin. 

Operation: 

FABS {float *£Rn) ■ /* FABS £Rn */ 
{ .. . 

clear_cause_VZ ( ) ; 
case (data_type_of (Ffin).) { 

NORM : if (sign_of (Ffin) == 0) *FRn = *£Rn; 

else *Ffin = - * FRn ; 

break; . s 



PZERO: 
NZERO: 
PINF : 
NINF : 
qNaN : 
■ sNaN : 

} 

pc += 2; 



zero (£Rn, 0) ; break; 

inf(£an,0) : ; ' break;, 
qnan (£Rn) ; break; 
invalid (£Sn) ; break; 



E£ja 


NORM 


+0 


-0 


+INF 


-INF 


qNaN 


sNaN 


FABS (£Rn) 


ABS 


+0 


+0 


+INF 


+INF 


qNaN 


Invaild 



Exceptions: 

Invalid operation 
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FADD (Floating Point Add): Floating Point Instruction 

f^U Abstract Code Latencv pugh Tbk 

FADD FRm, _F_Rn FRn+FRm -> FRn llllnnnnrnmrnmOOOO 2 l ., 

Description: Roating-point-arithmeucaily adds the contents of floating point registers FRm and FRn. And the result of 

this operation is written on the Efin. 
Operation: . 



FADD (float *E£m, *££n) 
{ 

clear_cause_VZ ( ) ; 
if ( (data_type_of (FRm) 
(data_type_of (££n) 
else if ( <data_type_of (Ffim) 
(data_type_of (F£n) 
else case (data_type_of (FRm) ) 
NORM : 

case (data_type_of (F£n) ) 



/* FADD £fim,£En. */ 



sNaN) 
sNaN) ) 
qNaN) 
qNaN) ) 
{ 



invalid (FRn) 
qnan(££n); 



,/ 



PINF 
NINF 
default 



{ 



} 



inf (Ffin, 0) ; 
inf (FRn, 1) ; 
*£En = *FRn 



breaks- 
break;- 



P2ERO: 

case (data_type_of (£En) ) 



NORM 

PZERO 

NZERO 

PINF 

NINF 



+ *FRm; 
break; 

{ 

+ ■ 



breaks- 



break; 



} 



zero (£Rn, 0) 
inf (ERn, 0) 
inf (££n, 1)' 



NZERO : 

case(data_type_of (£Rn) ) 



NORM 

PZERO 

NZERO 

PINF 

NINF 



} 



*£Rn = *F£n 
zero (FRn, 0) ; 
zero(££n, 1) ; 
inf (£En, 0) - 
inf (££n, 1) ; 



break; 
{ 

■+ *ISm; 



breaks- 
break; 
breaks- 



break; 



} 

PC 



PINF : 

case (data_type_of (flin,) ) 
NINF : invalid (£fin) ; 
default: inf(£Rn,0) 

} 

NINF : 

case (data_type_of (££n) ) 
PINF : invalid (ERn) ; 
default: inf(EEn,l). 

} 

+ ~ 2; 



break; 
{ 

break; 
{ 

breaks- 



break; 
breaks- 
break; 
breaks- 



break; 
breaks- 



break; 
break; 
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FADD Special Cases 





FEn 


.FRm 


NORM J 




-0 


+INF 


•INF 


qNaN 


sNaN 


JNURM 


ADD 








-INF 






. +0 




+0 












-0 






-0 










+ENF 








+ENF 


Invalid 






•INF 


-INF . 


Invalid 


-INF 






qNaN 




qNaN 




sNaN 




Invalid 



Denormalized value is treated as ZERO. . 



Exceptions: 

Invalid operation 



A3 6 



FCMP (Floating Point Compare): Floating Point Instruction 

Format ^stract Code • Latency Pi r.K X bit 

FCMP/EQ FRm,£En (ERn^EBm) 1 1:0 ->T llllnnnnmnmnOl^ 2 I TJT 

FCMP/GT FRm.FEn ( FRn> FRm) ? 1:0 ->T llllnnnnrnrnrnmOlOl 2 1 1/0 
Description: noating.point-ariihmetically compares between the contents of floating point registers £ftn and ££n 
And the result of this operation, true/false, is written on the T bit. 

Operation: 



FCMP_EQ( float »£Rm, « FRn) 



/* FCMP/EQ ZEm,_ESn V 



clear_cause_VZ ( ) ; 

if (fcmp_chk(£Em,£Rn) == INVALID) { 

else if(fcmp_chk(££m,££n) = = .EQ) 
else 
pc + = 2 ; 



fcmp_invalid{0) ; } 
T = 1; 

T = 0; 



} ' 

FCMP_GT( float * FRm, * FRn) 
{ 



./*■ FCMP/GT £Rm,££n */• 



clear_cause_VZ ( ) ; 

if (fcmp_chk(££m, F£n) == INVALID) . { 

else if <fcmp_chk(££m,E£n) == GT) 

else 

PC += .2; 

} : ■< 

fcmp_chk< float *£fim, *£Rn) 

{ 

if ( <data_type_of (£&n) == sNaN) | 
- (<iata_type_of (££n) == sNaN) ) 
else if ( (data_type_of (F£m) == .qNaN) | 
(data_type_of (££n) == qNaN) ) 
else case (data_type_of (FRm) ) { 



fcmp_invaiid{0) ; } 

T = 1; 
T = 0 ; 



NORM 



PZERO: 
NZERO : 



case (data_type_of (FRn) ) 
PINF : return (GT) ; 
NINF : return (NOTGT) 
default: 

} 

case (data_type_of (£Rn) ) 
PZERO : 

return (EQ) ; 
return (GT) ; 
return (NOTGT) 



return (INVALID) ; 
return (UO) ; . 
{ 



break; 
{ 



break; 
breaks- 
break; 



NZERO 
. PINF 
NINF 
default 



PINF 



NINF 



} 

case(data_type_of (£fin) ) 

PINF : return (EQ) ; 
^default: return (NOTGT) 

case (data_type_of (££n) ) 
NINF : return (EQ) 7 - 
default: return (GT) ; 

} * \ .... 

if (*£Rn==*FRm) return (EQ) ; 
else if (*£En>*£Em) return (GT); 
^ else return (NOTGT) ; 

fcmp_invalid( int cmp_flag) 



break; 
{ 



break 
break 
break 
break 



breaks- 
break; 



break; 
{ 



breaks- 



break; 
break; 
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set_V( ) ; 

if KFPSCR '& ENABLE_V) == 0) T = cmp_flag; 





FRn 1 


£Rm 


NORM 


+0 


-0 


+ENF 


•CNF 


qNaN 


sNaN 


NORM 


CMP 






GT 


! GT 






+o. 




EQ 










-0 












+INF 


! GT 


EQ 








-CNF 


GT 


EQ 






qNaN 




UO 




sNaN 














Invalid 



Deriormalized value is treated as ZERO. 



Exceptions: 

Invalid operation 



Note: EEE defines the independly 4 conditions of caparison. But FPU support FCMP/EQ and FCMP/GT only. But 
FPU can supprot all conditions using the conbination of BT/BF , FTST/NAN and these 2 FCMPs. 

ftst/nan£Em ; bf ; ftst/nan £Rn ; bf 



Unorder FRitl FR n 
(£Em==£fin) 
(ZRm != ££n) 
(BELm<£En ) 
(EEm <= £En) 
(FRm >F£n) 
(£fim>=££n) 



fcmp/eq Efim, ££n ; bt 
fcmp/eq FRm. FRn ; bf 
fcmp/gt £Rm, £En ; bt 
fcmp/gt ££n, ££m ; bf 
fcmp/gt £Rn, ££m ; bt 
fcmp/gt EEm t ZRn ; bf 
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FDIV (Floating Point Divide): Floating Point Instruction 
Format Abstract Code 



Latency Pitch T bit 



13 



12 



FDIV FRm, FRn ERn/EEm -> EEn llllnnnnrntninmOOll 

Description: noating-point-arithmeticaUy divides the content of floating point register £Rn by the content of floating 
point register ££m. And the result of this operation is written, on the ERn. 

Operation: 

FDIV { float *£Rm, *_ERn) /* FDIV ERm,_ERn V 



{ 



clear_cause_VZ { ) ; . 

if ( (data_type_of (EEjn) == sNaN) || 

(data_type_of (ERn) == sNaN) ) invalidlERn).; 
else" i£( <data_type_o£ (EBfli) = = qNaN) || 

(data_type_of (ERn) == qNaN) ) qnan(ERn) ; 
else case (data_type_o£ (E&n) ) ( 
NORM : 

case(data_type_pf (FRn) ) { 
PINF : 

NINF : inf (£En ( sign_of (£Em)"sign_of (EEn) ) ;, 

default: *££n = *£Rn / *£&n; 
y break; 

PZERO: 
NZERO: 

case (data_type_of (ERn) ) { 
PZERO : 

NZERO:. • invalid (ERn) ; 
default: dzfFRn. sian _o£ (ER*0 *sign_of (ERn) ) ; 
} break; 

. PINF : 
NINF : 

case (data_type_of (£Rn) ) { 
' PINF : 
NINF : invalid(ERn) ; 
default: zero (£Rn, sign_of (£Rm) ~sign_of (ERn) ) 
} break; 

) 

pc += 2; 



breaks- 
break;. 



break; 
break; 



break; 
break; . 



1 1 








FRn 






FRm 


NORM 


+0 


-0 


+INF -INF 


qNaN 


sNaN 


NORM 


DP/ 


0 


INF 






+0 


DZ 


Invalid 




DZ 






-0 














+INF 


0 


+0 


-0 


Invalid 






-INF 




-0 


- +0 








qNaN 










qNaN 




sNaN 












Invalid 



Denonnalized value is treated as ZERO. 



Exceptions: 

Invalid operation 
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FLDIO (Floating Point Load Immediate 0): Floating Point Instruction 

Format ' Abstract Code Latency Pitch T bit 

FLDIO FRn 0x00000000 '-> F£n llllnnnnlOOOHOl 2 1 ~ 

Description: Loads floating point zero (0x00000000) to the floating point register £Rn. 
Operation: 

FLDIO { float: *£fin) /* FLDIO £Rn */ 
{ 

*£Rn = 0x00000000; 
PC += 2; ■' . 

} 

Exceptions: 

None / 
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FLDI1 (Floating Point Load Immediate 1): Boating Point Instruction 

Fprmat Abstrac * Code Latency Pitch T hit 

FLDI1 EEn 0x3F800000 -> FRn llllnnnnlOOlllOl 2 1 7~ 

Description: Loads floating point one (0x3F8000OO) to the floating point register ERn. 
Operation: 

FLDIl (float *££n) /* FLDI1 ££n V 

{ . • ' ' .. 

*EEn = Ox3F800000; 

pc += 2; • . • . 

>• • ' ' , \ . '■ • • 

Exceptions: 
None 
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FLDS (Floating Point Load to System Register): Floating Point Instruction 

Format Abstract Code Latency Pitch T bit 

FLDS FRm, FPUL FPju -> FPUL UllnnnnOOOlllOl 2 1 

Description: Copies the concent of floating point register ERjn to the system register FPUL. 

Operation: 

'FLDS (float *£Bm, *F?UL) /* FLDS £Rm, FPUL */ 

{ - ' ' 1 . 

*FPUL = *F£m; .' , 

pc += 2; 

Exceptions:, • 

. None - 
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FMUL (Floating Point Multiply): Floating Point. Instruction 
Format Abstract Code 



Latency Pitch T bit 



FMUL FRm, FR n FRn * FRm -> FRn 



111 Innnn rnrnmm O 010 



Description: Roating-point-anthmetically multiplies the contents" of floating point registers ERm and EEn. And the 
result of this operation is written on the FRn . 

Operation: 

"FMUL (float *£fim,*_EEn) /*..' FMUL .EB^EBn .*/ 



{ 



} 

pc 



clear_cause_VZ ( ) ; 

if ( (data_type_of (££m) -= sNaN) || 

(data_type_of (E&n) == sNaN) ) invalid (EEn) ; 
else if ( (data_type_of (Efrn) == qNaN) || 

(data_type_of (ZRn) == qNaN)) qnan(ERn); 
else, case (data type of (FRm) ) .{ 
■ . NORM : ■ / , 

case (data type of (FRn)) { 
PINF. : 

NINF : inf (£Rn,sign_of (£Em) *sign_of (IRn) ) ; 
default: *££n = (*F£n).* (*££m); 

} . break; 

PZERO: 
NZERO : 

case (data_type_of (£Rn) ) {..'".'" 
PINF : ' • . 

•■ NINF : invalid (Efin) ; 

default: zero (£Rn/sign_of (£Bjn) ~sigri_of (£Rn) ) ; 

J break; 
PINF : 

NINF : " 

case (data_type of (FRn)) { 
PZERO: 

NZERO: invalid (££n) ; 
.default: inf (££n, sign_of (££m) "sign_of (£En) ) ; 

breaks- 



break; 
break; 



break; 
break; 



} 
2; 



break; 
break; 



FMUL Special Cases 







ERn 




NORM 


+0 


-0 


+INF 


-INF 


qNaN 


sNaN 


NORM 


MUL 


0 


INF 








+0 


0 


+0 


-0 


Invalid 








-0 




-0 


+0 










+ENF 


INF 


Invalid 




+INF 


-INF 






-INF 








-INF 


+INF 






qNaN 






qNaN 




sNaN 1 




Invalid 



Denormalized value is treated as ZERO. 



Exceptions: 

Invalid operation 
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FNEG (Floating Point Negate): Floating Point Instruction 

Format Abstract Code Latency Pitch T bit 

FNEG FRn . -FRn -> £Rn llllnnnnO'1001101 2 1 

Description: . Floating-point-anthmetically negates the content of floating point register ££n. And the result of this 
•operation is written on the £Sn. 

Operation: 

FNEG ( float '*£Sn) /> . FNEG £fin V 

{ . ' ■ " ■ ' -. ■ ■ 

clear_cause_VZ ( ) ; 

case (data_type_of <£fin) ) { 

qNaN : qnan(ERn) ; break;. 

sNaN : invalid(ERn) ; break; 
: default: *£Rn =. - ( *£fim) ; . break; 

} ; ■ ; " • " ' 

pc += 2 ; 



FNEG Special Gases 



ERn 


NORM 


+0 


-0 


+INF 


-INF 


qNaN 


sNaN 


FNEG(F_nj 


NEG 


-0 


+0 


-INF 


+1KF 


qNaN 


Invaild 



Denprmalized value is treated as ZERO. 



Exceptions: 

Invalid operation 



A44 



FSQRT (Floating Square Root): Floating Point Instruction 

Format Abstract . Code Latency Pitch T bit 

FSQRT FRn. ' " . WFRn -> FRn llllnnnnOllOllOl 13 12 

Takes floating-point-arithmetic square root of the content of floating point register ERn. And the result 
of this operation is written on the £&n. . ■■ 

FSQRT (float *£Rn) / * FSQRT ££n */ . 

{ : • ■ . • 

clear_cause_VZ ( ) ; 

case(data_type_of (£Rn) ) {. . ' .. " 

NORM : . if (sign_of (Efin) ==0) 

*£Sn = sqrt{*ESn) ;. ■ 
else invalid (ERn) ; break; ... 

■ PZERO: , 

NZERO: ■ ' ■ ' 

• PINF :'. *FRn = *F£n ; break; 

NINF : invalid(FRn) ; break; 

qNaN : . qnan(FEn); break; 

sNaN : . invalid (FRn) ; break; 

pc +- 2 ; . 



FSQRT Special Cases 



FRn 


+NORM 


-NORM 


+0 


-0 


+INF 


-INF 


qNaN 


sNaN 


FSQRT (FRn) 


SORT 


Invaild 


+0 


-0 


•+INF 


Invaild 


qNaN 


Invaild 



Denormalized value is treated as ZERO. 



Exceptions: 

Lnvalid operation 



Description: 
Operation: 
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FSTS (Floating Point Store from System Register): Floating Point Instruction 

Format ' Abstract Code • : Latency Pitch Tbit 

FSTS FPUL,F£n FPUL -> FRn llllnnnnOOOOHOl 2 1 

Description: Copies the content of the system register FPUL to floating point register ££n. 

Operation: 

FSTS {float *ERn,'*FPUL) / * FSTS FPUL , ££n */ 

*£Rn = . *FPUL; 

pc + = . 4 ; . 

} ' ". . • 

Exceptions: 
None 
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FSUB (Floating Point Subtract): Floating Point Instruction 
Format Abstract Code 



Latency Pitch T bit 



FSUB FRm, FR n FRn - FRm -> FR n 



111 lnxinn rnrnmrn O 00 1 



Description: 



Operation: 



Floating-point-arithmetically subtracts the content of floating point register FRm from the content of. 
floating point register ERn. And the result of this operation is written on the £Rn. 



FSUB (float *£Rm, * JERn) /* FSUB ERm,£Rn */ 
{ 



clear_cause_VZ ( ) ; 
i f ( ( da ta_type_o f { ERm) 
(data_type_of (ERn) 
else if { {data_type_of (£Rm) : 
(data_type_qf (ERn) 
else case (data type of ( FRm) ) 
NORM : 

case (data_type_of ( FRn) ) 



. PINF 
NINF 
default 



inf (FRn. 
inf (FRn, 



} 



■= sNaN) 
:= sNaN) ) 
« qNaN) 
:= qNaN) ) 
{ 

{ 

0) ; . 

1) ; 

- *F£m; 

break; 



invalid(£Rn) 

I 

qnan (ERn) ; . 



break; 
breaks- 
break; 



PZERO : 

case (data_type_of ( FRn) ) 



NORM 

PZERO 

NZERO 

PINF 

NINF 



*FRn = *£Rn 
zero ( FRn . 
zero (FRn, 
inf (FRn, 
inf (FRn. 



{ 

- *FRm; 
0) 
1) 
0) 
1) 



break ; 
break; 
breaks- 
break; 
break; 



} 



NZERO: 

case (data_type_of (FRn) ) 



break; 



{ 



NORM 

PZERO 

NZERO 

PINF 

NINF 



*£Rn = *ERn - *ERm; break; 



zero (ERn, 
inf (ERn, 
inf (£Rn, 



} 

pc 



} 

PINF : 

case (data_type .of ( FRn) ) 
NINF : . invalid (ERn) 
default: inf (FRn. 

} 

NINF : 

case (data type of(Ffrn) ) 
PINF : invalid(ERn) 
default: ' inf (FRn. 

} 

2; 



1) 



0) 



break; 
breaks- 
break; 



break; 

{ . 

break; 
{ 

breaks- 



break; 
breaks- 



break; 
break; 



A47 



FSUB Special Cases 





~?.n 








NORM 




•0 


-[NF 


-INF 


qNaN 


sNaN 


NORM 


SUB 




--ENF 


-[NF 






-0 


















•0 




-0 














-CNF 


-[NF 


Invalid 








■CNF 




Invalid 






qNaN 














/ qNaN 




sNaN 
















Invalid 



DenormaJized value is treated as ZERO. 



Exceptions: 

Invalid operation 
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FTRC (Floating Point Truncate and Convert to Integer): Floating Point Instruction 
Format Ab stract Code Latency Pitch T bit 



FTRC FEm, FPUL 
Description: 
Operation: 



(long)FSm -> FPUL llllnnnnOOllllOl 2 



Interprets the content of floating point register £Em as a floating point number value and truncates it to 
an integer value, And the result of this operation is written to the FPUL. 



#de£ine N_INT_RENGE 0xc7000000 /* -1.000000 * 2*31 */ 
#define P_INT_RENGE 0x46ffffff '/* l.fffffe * 2*30 V 

FTRC(float *£Bm, int *FPUL) . /* FTRC £Em f FPUL */ 



{ 



clear_cause_VZ ( ) ; 

case ( f trc_type_of (FRm) { 



, NORM 
PINF 
NINF 

} 

pc += 2; 



*FPUL = <long)'(*EBm) 
ftrc_invalid(0) ; / 
f trc_invalid ( 1) ; 



break; 
break; 
break; 



} 

int f trc_type_of (long *src) 

( - ; - - • 

long abs; 

abs = *src .& 0x7fffffff ; 
if (sign_of (sre) == 0) { 

if (abs > 0x7f800000 ) return (NINF) ; /* NaN */ 
else if (abs > P_INT_RENGE) return ( PINF) ; /* out of renge,+INF */ 
else "■- return (NORM) ; /* +0,+NORM */ 

} 

else " {• 

if (abs > N_INT_RENGE) return (NINF) 
else return (NORM) 



> 
} 

ftrc_invalid(long *dest, int sign) 
{ 

set.VO; 

if((FPSCR & ENABLE_V)- == 0){ 

if (sign ==0) *dest = OxTfffffff; 

else *dest = 0x80000000; 



/♦out of renge, +INF,NaN . */ 
/* -0, -NORM */ 



Ffin 


NORM 


+0 


-0 


positive 
out of 
range 


negaitive 
out of 
range 


+ENF 


-ENF 


qNaN 


sNaN 


FTRC 
(£fin) 


TRC 


0 


0 


Invaild 
+MAX 


Invaild 
-MAX 


Invaild 
+MAX 


Invaild 
-MAX 


. Invaild 
-MAX 


Invaild 
-MAX 



Denormalized value is treated as ZERO 



Exceptions: 

Invalid operation 
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FTST (Floating Point Test): Floating Point Instruction 

Format Abstract Code . Latency Pitch T bit 

FTST/NAN FRn (FRn==NAN) ? 1:0 ->T llllnnnnOlllllOl 2 .1. 1/0 

Description: Floaung-point-arithmetically tests which the contents of floating point register £Rn is NAN or not. And 
the result of this operation, true/false, is written on the T bit. 

Operation: 

FTST_NAN(f loat *£fin) /* FTST/NAN ££n */ 

clear_cause_VZ ( ) ; . 

case (data_type_o£ (EEn) ) { 

NORM : 

PZERO: 

NZERO: ^ 
PINF : . / ■ 

NINF : T = 0; "■ break; 

qNaN . : ■■ ' ■ T = 1; break; 
sNaN : f cmp_inval id ( 1 ) ; break; 

} " ' • ' ■ 

pc += 2 ; 

} 



FTST/NAN Special Cases 



FRn 


NORM 


+0 


-0 


+INF 


-INF 


qNaN 


sNaN 


FTST/NAN (ZBfi) 


T=0 


T=0 


T=0 


T=0 


T=0 


T=l 


Invaild 



Denormalized value is treated as ZERO. 



Exceptions: 

Invalid operation 
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LDS (Load to System Register): CPU Instruction 

Format Abstract Code Latency Pitch T bit 

1 LDS -Rm , FPUL Rm->FPUL OlOOnnnnOlOllOlO 1 1 

2 LDS^L @Rm+,FPUL @Rm->FPUL ,Rm+=4 OlOOnnnnOlOlOllO 2 1 

3 LDS Rm , FPSCR Rm->FPSCR OlOOnnnnOllOlOlO 1 1 \ - 

4 LDS^L @Rm+, FPSCR 8Rm->FPSCR, Rm+=4 OlOOnnnnOllOOllO 2 1 - 

- Description: 1 . Copies the content of general purpose register Rm to system register FPUL. 

2. Loads the content of the memory location addressed by general register Rm. And the result of this 
operation is written on system register FPUL. Upon successful completion, the value in Rm is 
incremented by 4. 

3. Copies the content of general purpose register Rm to system register FPSCR. The predetermined bits of 
FPSCR remain unchanged. 

4. Loads the content of the memory location addressed by general register Rm. And the result of this 
operation is written on system register FPSCR. Upon successful completion, the value in Rm is 
incremented by 4. The predetermined bits of FPSCR remain unchanged. 

Operation: . 

#def ine FPSCR_MASK 0x00018c60 

LDSdong ^Rm, *FPUL) 7-*. LDS Rm/FPUL */ 

{ , 
*FPUL = *Rm; 
pc += 2; 

} . ' . • •'. - . . 

LDS_RESTORE ( long *Rm, *FPUL) /* LDSJ^ @Rm+,FPUL */ 

{ , .. .. - - - 

if (load_long{Rm, FPUL) != Address_Error) *Rm += 4; 

pc += 2 ; 

>' • : l '• ; 

LDSClong *Rm, *FPSCR) 7 * LDS Rm, FPSCR * / 

{ ' . • . ■ ' • 

*FPSCR = *Rm-& FPSCR_MASK; 
PC +=2; ' 

LDS_RESTORE ( long *Rm, 'FPSCR ) /* LDS^ @Rm+ , FPSCR V 

{ . . . . 

long *tmp_FPSCR; 

if { load_long(Rm, tmp_FPSCR) ! = Address_Error ) { 
* FPSCR = *tmp_FPSCR & FPSCR_MASK; 
. *Rm +=4; 

} ., ; ' ■ . . 

pc += 2 ; 

} • • • " ■ . 

Exceptions: 

Address Error 
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STS (Store from System Register): CPU Instruction 
Format Abstract Code 



Latency Pitch T bit 



1 STS FPUL , Rn FPUL -> Rn 

2 STSX FPUL , @-Rn Rn-=4,FPUL.->@Rn 
.3 STS FPSCR, Rn FPSCR-> Rn 
4 STSJ* FPSCR, @-RnRn-=4,FPSCR->@Rn 



OlOOnnnnOlOllOlO 1 1 

OlOOnnnnOlOlOllO 2 1 

OlOOnnnnOllOlOlO 1 .1 

OlOOnnnnOllOOllO 2 1 



Description: 



1 . Copies the content of system register FPUL to general purpose register Rm. 

2. Stores the content of system register FPUL into the memory location addressed by general register Rn 
decremented by 4. Upon successful completion, the decremented value becomes the value of Rn. 

3. Copies the content of system register FPSCR to general purpose register Rm. 

4. Stores the content of system register FPSCR into the memory location addressed by general register 
Rn decremented by 4. Upon successful completion, the decremented value becomes the value of Rn. 



Operation: 



*FPUL,*Rn) 



/* STS FPUL, Rn V 



STS^L FPUL,(3-Rn */' 



STS { long 
{ 

*Rn = *FPUL; 
pc += 2 ; 
} ■ - 

STS_SAVE(long *FPUL, *Rn) 
( 

long *tmp_address = *Rn - 4; 

if (store_long(FPUL, tmp_address) • = Address_Error ) 
Rn = tmp_address; 
pc += 2; 

> 

STS (long *FPSCR, *Rn) 

{ - 

*Rn = *FPSCR; 
pc + = 2; 



/* STS FPSCR, Rn */ 



} 

STS_RESTORE( long *FPSCR, *Rn) /* 
{ 

long ■* tmp_address = *Rn - 4; 

if (store_loncr(FPSCR, tmp_address ) 
Rn = tmp_address; 
pc += 2; 

} . 



STS^L FPSCR, @-Rn */ 



Address_Error ) 



Exceptions: 

Address Error 
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FLOAT (Floating Point Convert from Integer): Floating Point Instruction 

Format Abstract Code Latency Pitch T bit 

FLOAT FPUL,ESn ( float) FPUL -> F£n UllnnnnOOlOllOl 2 1 

Description: Interprets the content of FPUL as an integer value and converts it to a tloating point number value. And 
the result of this operation is written on the floating point register FRn . 

Operation: 

FLOAT { int *FPUL, float *££n) /'* FLOAT ££n V 
t 

clear_cause_VZ ( ) ; 
*FRn = (float). *FPUL; 
pc + = 2; 

} 

Exceptions: / 
None 
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FMAC (Floating Point Multiply Accumulate ): Floating Point Instruction 

Format Abstract ] Code Latency Pitch Tbit 

FMAC FRO , FRm, FR n FRO*FRm+FRn -> F£n 111 lnnnnrnmmml 110 2 1 

Description: Roating-point-arithmeticaily multiplies the contents of tloating point, registers ffiO and ERm. And the 
result of this operation is accumulated to tloating point register ERn. 

Operation: 

FMAC(float *EfiO, *£Ba. *JERn) /* FMAC £R0,I£m,_I£n V 

{ - • . 

long tmp_FPSCR; 
float * tmp_FMUL - *£Em; 

FMUL (EfiO , tmp_FMUL) ; 

pc -= 2; /* correct pc */ 

tmp_FPSCR = FPSCR; /* save cause field for £B0*£Rm */ 
F ADD ( tmp_FMUL , F£n ) ; 

FPSCR 1= tmp_FPSCR; /* reflect cause field for ££0*£E^ */ 

1 FMAC Special Cases 















jn 




FRn 


PRn 


+NORM 


-NORM | 


+0 




+INF | 


-INF 


WORM 




MAC 








INF 




0 










Invalid 




+INF 


+INF 


-INF 


Invalid 




+INF 


-INF 




-INF 


-INF 


+INF 






-INF 


+INF 


+0 


NORM 


MAC 






INF 




0 








40 j 


Invalid 




■ +INF. 


+INF 


-INF 


Invalid 




+INF 


-INF 




-INF 


-INF 


+INF 






-INF 


+INF 


-0 


+NORM 


MAC 




+0 


-0 


+INF 


-INF 




-NORM 






-0 


+0 


-INF 


+INF 




+0 


+0 


-0 


+0 


-0 


Invalid 






-P 


-0 


+0 


-0 


+0 








+INF 


+INF 


-INF 


Invalid 




+INF 


-INF 




-INF 


-INF 


+INF 






-INF 


+INF 


+INF 


+NORM 


+INF 










Invalid 




-NORM 












+INF 




0 










Invalid 






+INF 






Invalid 




+INF 






-INF 


Invalid 


+INF 


I 






+INF 


-INF 


♦NORM 


-INF 








1 


-INF 




-NORM 
















0 










1 






+INF 


Invalid 




| Invalid 






-INF 




-INF 


-INF 








-INF 


Invalid 


qNaN 


0 










Invalid 


INF 






| Invalid 










! sNaN 














<-! NaN 


qNaN 














ail types 


sNaN 














sNaN 


all types 















qNaN 



sNaN 



Invalid 



De normalized value is treated as ZERO. 



Exceptions: 

Invalid operation 
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FMOV (Floating Point Move): Floating Point Instruction 

Format Abstract Code Latency Pitch T bit 

1 FMOV FRm, FRn FRm -> FRn llllnnnnnirnmrraiOO 2 1 - 

2 FMOV, S @Rm, FRn (Rm)-> FRn . llllnnnnrnmmmlOOO 2 1 - - ' 

3 FMOV^ ZEm, £Rm->(Rn) llllnnnnrnmrranlOlO 2 1 - 

4 FMOV . S @Rm+ , FRn (Rm) -> FRn; Rm+=aillnnruiinrnrraa001 2 . 1 

5 FMOV . S FRm, @-Rn Rn-=4, FRm-> (Rn llllnnnnrnmrnmlOll 2 - 1 

6 FMOV^£ @(R0^Rm) , FRn (RO+Rm) -> F£n UllnnnnmmmmOllO 2 1 

7 FMOV. S FRm,@(R0^Riu) FRm-> (RO+Rn) 111 InnnnniuiuimO 111 2 1 - 

Description: 1 . Moves the content of floating point register FBm to the floating point register ERn. 

2. Loads the content of the memory location addressed by general register Rm. And the result of this 
operation is written on the floating point register H£n. 

3 Stores the content of floating point register FRm into the memory .location addressed by general register 

/ ... - 

Rn. 

4. Loads the content of the memory location addressed by general register Rm. And the result of this 
operation is written on the floating point register ERn. Upon successful completion, the value. in Rm is 
incremented by 4. 

5. Stores the content of floating point register FRm into the memory location addressed by general register 
Rn decremented by 4. Upon successful completion, the decremented value becomes the value of Rn. 

6. Loads the content of the memory location addressed by general register Rm and RO. And the result of 
this operation is written on the floating point register ERn. 

7. Stores the content of floating point register FRm into the memory location addressed by general register 
Rn and RO. 

Operation: 

FMOV (float *£Em, *ERn) ./* FMOV^ E£m,ERn V , . 

{ 

. *ERn = *£Rm; 

pc += 2 ; 

} • • 

FMOV_LOAD(long t Rm, float *IRn) /* FMOV @Rm r £Rn */ 

{ ; . 

load long (Rm. FRn) ; 
pc += 2 ; 

} ■:■ ' • . - • • " ' - • '. . . . ..." 

FMOV_STORE( float * FRm. long *Rn) /* FMbV__S Efim, ®Rn */ 

{ 

store_long(£Em > Rn) ; 
pc += 2 ; 

} 

FMOV — RESTORE (long *Rm, float *ERn) /* FMOV . S, <3Rm+,££n */ 

. ( • ' • ' 

if f load long (Rm. FRn) 1= Address_Error ) *Rm +=4; 

'-"pc ■+ = 2; 

} \ 

FMOV_SAVE( float *JE&m, long *Rn) /* FMOV^ £Bm,G-Rn */ 

{ ' ' ' " 

long *tmp_address = *Rn - 4; 

if (store_long (£Em, tmp_address) ! = Address_Error ) Rn = tmp_address; 

pc += 2 ; 

} 

FMOV_LOAD_index(long *Rm, -long *R0, float *£Rn) /* FMOV^S <M RO^Rra) , ££n */ 

{ . .load_long(&(*Rm+*RO) ,FRn) ; 
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. pc += 2 ; . 

}' . . 

FMOV_STORE_index( float *E£m, Long *R0, long *Rn) /* FMOV^S £fim f 9 ( RO^Rn) */ 
{ . 

store_long(£Bm,M*Rn+*RO) ) 

pc *= .2 ; ■ 

• } ' . - . 

Exceptions: 

Address Error 
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2. CPU/FPU Interface Signals 



Name 


Phase 


. Width 


Direcdon 


s2_d 


ph2 


32 


S bus to FPU 


c2 sbrdv 


ph2 


1 


CCN to FPU 


t2_dbmiserr 


ph2 


I 


MMU to FPU 


u2_abrk 


ph2 


1 


UBC to FPU 


u2 brktyp 


ph2 




UBG to FPU 


s2_fstaU 


ph2 




CPU to FPU 


s2_dfcirsel 


ph2 


4 (1-hot) 


CPU to FPU 


sl_dfcirtseid 


phi 




CPU to FPU | 


[ s2_irthihl 


ph2 




CPU to FPU 


s2_irthill 


ph2 




CPU to FPU 


s2_fbypass 


ph2 




CPU to FPU 


s2_fcancel 1 


ph2 




CPU to FPU 


sl_fcancel2 


phi ' 1 




si Jin valid 


phi 




CPU to FPU 


f2_tbit 


ph2(El) 




FPU to CPU 


f2_busy 


ph2 (D) 




FPU to CPU 


f2_exc 


ph2(El) 




FPU to CPU 



Notes: 

1. s2_ffreezc may be necessary. (Any particular necessity has not yet been identified) 

2. This list only includes the CPU/CCN/FPU signals. It does not include signals such as reset, 12 Jrdy. etc. 

3. Due to lack of detailed information regarding SH3 CPU logic, we are not sure if all the above signals are real] 
necessary. Redundancy of signals will be eliminated as the simulation and validation proceeds. 



2.1 Signal Descriptions 

2.1.1 s2_d 



2.1.2 c2_sbrdy 

2.1.3 t2 tlbmiserr 



2.1.4 
2.1.5 
2.1.6 
2.1.7 
2.1.8 



u2_abrk 
u2J)rktyp 
s2_fstall 
s2_dfcirsel 
si dfcirtseld 



2.1.9 s2 irthihl 

2.1.10 s2_inhill 

2.1.11 s2_fbypass 



Carries the S-Bus data between the CPU, CCN, MAC and FPU. The FPU 
will monitor it during instruction fetches, floating point loads and LDS 
instructions that write to the FPUL register. It will put data on the bus during 
floating point stores and STS instructions that read the FPUL register. 
Indicates when the data on s2_d is ready. It is also used to indicate cache 
misses- 
Indicates that some form of a TLB exception has occurred This signal will tell 
the FPU when to cancel floating point operations that are in progress. 
Indicates that an address break has occurred 
Indicates the type of the address break. 

indicates a NOP needs to be introduced at the El -stage of the FPU pipe. 
4 bit one^hot signals from CPU to control I-Fetch databath 4 input Mux. 
signal to I-fetch dtapath to select the high or low of the I- word to be loaded in 
the instruction buffer. 

selection for the high order bytes of the instruction on the s2_d bus. 
selection for the low order bytes of the instruction on the s2_d bus. 
Tells the FPU which data to forward to each instruction, (see section 3.8) 



2.1.12 si_fcancel2 Tells the FPU to cancel the instructions in its I, D t El and E2 stages. 

2.1.13 si Jinvalid Tells the FPU to ignore the instructions on the s2_d bus. 
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Condition code generated by the floating point compare instructions. 
Indicates that the FPU is executing a floating point divide or square root. 
Indicates that there was an FPU exception. 
Tells the FPU to cancel the instruction in I and D stages 



2.2 Signals that should be synthesized at the FPU end 

Some signal names are used in this documentation that are not directly available as interface signals. 
We assume that these signals will be formed at the FPU end from the available interface signals. 

2.2. 1 s2_ifetch OR ( s2_irthihl , s2_irthill ) , signal indicates instruction fetch is in progress. 



2.1.14 f2 tbit 

2.1.15 f2_busy 

2.1.16 f2_exc 

2.1.17 s2 fcancell 
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3.8 FPU Bypass Cases r , 

To improve its instruction throughput, the FPU supports register forwarding jot bypass 

paths are used, t ip- 

1. S-bus (data bus v 222) to El input (Fra) 

2. S-bus to El input (Fn) 

3. S-bus to E2 input 

4. E2 output to E2 input 

5. E2 output to El input (Fn) 

6. E2 output to El input (Fm) 

7. E2 output to Fmac input (FO) 

8. S-bus to Fmac input (FO) 

Four bit signal s2_fbypass from the CPU controls bypass at the FPU end. The encoding 
is as follows: 

s2_fbypass[4]:bypass to Fmac input (FO) 
s2_fbypass[3]:bypass to E2 input i 
s2 fbypass[2]:bypass to El input (Fn) 
s2~fbypass[l]:bypass to El input (Fm) 

... " ■ . / . ■ 

s2_fbypass[0] = l:bypass from S-bus 
s2_fbypass[0] = Orbypass from E2 

3.8.1 Bypass from Ei output to E2 input 



FADD FR3.FR1 



FMOV.SFRI.@Rn 



I 


D 


El 


E2 4 s 






I 


D 






s 



Bypass Signal 
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3JL2 Bypass from E2 output to Rn or Rxn input of Ei 

FADD FR3, FR1 
FADD FR1, FR4 



I 



Stall Signal 
Bypass Signal 



D 



El 



I 



D 



D 



E2 S 
-AL_ 

2L^ 



El 



J l_ 

'_T~L 



E2 



(NOP) 



3.9.1 Bypass from data bus 222 to Rn or Rm input of E, 



FMOV.s@Rm,FR 
FADD FR1,FR2 



I 


D 


El 


E2 

— \ 


S 










I 


D 


X 




X 


(NOP) 








D 


El 


E2 


S 





Stall Signal 
Bypass Signal 
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